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The extended utilization of picture-enhancing or manipulating tools has led 
to ease of manipulating multimedia data which includes digital images. 
These manipulations will disturb the truthfulness and lawfulness of images, 
resulting in misapprehension, and might disturb social security. The image 
forensic approach has been employed for detecting whether or not an image 
has been manipulated with the usage of positive attacks which includes 
splicing, and copy-move. This paper provides a competent tampering 
detection technique using resampling features and convolution neural 
network (CNN). In this model range spatial filtering (RSF)-CNN, 
throughout preprocessing the image is divided into consistent patches. Then, 
within every patch, the resampling features are extracted by utilizing affine 
transformation and the Laplacian operator. Then, the extracted features are 
accumulated for creating descriptors by using CNN. A wide-ranging analysis 
is performed for assessing tampering detection and tampered region 
segmentation accuracies of proposed RSF-CNN based tampering detection 
procedures considering various falsifications and post-processing attacks 
which include joint photographic expert group (JPEG) compression, scaling, 
rotations, noise additions, and more than one manipulation. From the 
achieved results, it can be visible the RSF-CNN primarily based tampering 
detection with adequately higher accurateness than existing tampering 
detection methodologies. 
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1. INTRODUCTION 


The emergence of social networks in our daily life spurs the advent of various digital image editing 
tools which leads to belief issues of multimedia content being circulated. Designing high-quality tampering 
employing a machine learning model that is visually undetectable through the human eye [1], [2] is well 
within reach of the user through the following tool such as FaceApp [3], Adobe Sensiei [4], DeepPhoto 
editor [5], and Adobe sky Replace [6]. Thus, classifying an image as tampered with or not is becoming an 
extremely difficult task. In general, tampering is classified into content preserving and changing [7]. The 
copy-clone, splicing, and object exclusion are widely used primary attacks, and blurring, compression, and 
contrast adjustment are commonly used secondary attacks. However, the primary attack will result in a 
semantical illustration of an image. Thus, this work focuses on detecting the primary attack. 
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Recently, several methods have been presented for detecting primary tampering attacks [8]-[10]. 
However, these methods have focused on detecting whether an image is tampered or not and failed to 
localize tampered region within a tampered image [11], [12]. In [13], [14] able to localize the patches that are 
tampered with through the employment of frequency domain [15], [16]. Sudiatmika et al. [17] used noisy 
information [18] of compressed image for establishing whether the image has tampered or not. In recent 
times deep learning techniques have achieved a very good result in computer vision applications [19]-[21] 
including image tampering detection and classification [22], [23]. The autoencoder [24] and convolution 
neural network (CNN) [22], [23] have been widely used to detect primary attacks such as splicing [24], [25] 
and copy-move [26]. However, these model fails to provide good accuracy when image undergoes a hybrid 
transformation as they are designed to detect either splicing or copy-move. Further, the traditional fully- 
connected CNN-based framework [27] fails to generalize different noise induced through different tampering 
detection methods; thus, poor tampering region localization outcome is achieved. 

In addressing such issues in this work presented an improved tampering detection model employing 
improved preprocessing, feature aggregation, and CNN architecture [11]. Bunk ef al. [11] showed, image 
tampering induces noise because of periodic interpolation among adjacent pixels which can be understood 
through resampling features [13], [28]. Here affine transformation and Laplacian function is used for 
extracting resampling features and descriptor is built through CNN. The significance of range spatial filtering 
(RSF)-tamper detection (TD) is described next. The paper presented a CNN-based tampering detection 
method by learning Resampling Features. The RSF-TD can extract useful features among adjacent pixels of 
both horizontal and vertical directions with better accuracy. The RSF-TD can be used for detecting tampered 
image that has undergone multiple tampering. The RSF-TD achieves very good precision, Fl-score, and 
recall performance in comparison with the recent tampering detection method. 

The paper is arranged as follows: the proposed resampling feature tampering detection method 
through CNN is conferred in section 2. The overall outcome achieved using the RSF-TD method over 
different tampering detection model are given. The last section discusses the significance of RSF-TD and 
also discusses the future direction of research work. 


2. EFFICIENT RSF AND CNN MODEL FOR IMAGE FORGERY DETECTION 

Here the tampering detection through resampling feature extraction and CNN descriptor is 
presented. For detecting tampering and segmenting tampered region efficiently the following design is 
presented in Figure 1. This RSF-TD architecture is having six steps. First, the image is segmented into 
different patches. Then, the feature is extracted using a scale-invariant descriptor for establishing the 
duplicated region even under the small and smooth region. 
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Figure 1. Methodology of proposed RSF-CNN Model for tampering detection 


2.1. Preprocessing and resampling feature detection and extraction 

Generally, the tampered images will have significant impacts on the statistical properties along the 
edges. Similar to methodologies presented in [29] in this work the resampling feature are extracted through 
affine transformation and Laplacian function. Here the image is divided into non-overlapping with patch size 
set to 64. The dimension of the patch will be 64*64 for considering an image size of 512*512. Keeping the 
size to 512*512 significantly reduce the computation time in detecting and localizing tampering region. To 
predict the linear error Laplacian function is utilized [13]. Affine transformation matrices are used for 
collecting the errors considering various directions and angels. Finally, Fourier transformation is used for 
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extracting RSF considering diverse tampering transformation. The existing method performs localization at 
pixel level; however, the RSF-TD performs localization at the pixel level. Here the patch size is set to 32*32 
and block size is kept to 8*8 for extracting local RSF with less redundancy. Once RSF is extracted they are 
trained using CNN; achieving higher accuracy depends on the order of feature aggregation. The existing 
model arranges the features in the horizontal or vertical direction; thus, poor correlation among different 
directions significantly impacts accuracy. To address the aforementioned issues in this space-filling 
curve [30]-[33] is used. The spacing filling is very effective in converting the multi-dimension problems to a 
single dimension [34]-[37]. 
2.2. Aggregation of features and decision using CNN for tampering detection 

The previous section extracts efficient RSF features; these features must be cumulated to build an 
efficient learning mechanism for the classification of the image is undergone hybrid primary attacks. 
Different aggregation methods are modeled in (1) to (4) [38]. The (1) defines the minimum aggregation 
model, 


G, pes Gj (1) 


Gy=. max. G; (2) 
The (3) defines the mean aggregation function 
3 
oe 4G; (3) 
The (4) squared mean aggregation function 


= 200 G (4) 


In (1) to (4), the parameter G; = [Gi ey Gio] defines the amount of feature extracted in the j*” 
patch and O, defines the maximum patch size used. In this work we used different pooling methods; then 
features are aggregated for eliminating spatial dependency. Let 6 defines CNN parameter, M defines loss 
function of CNN structure sued, and Gag, defines cumulated features. Then, the gradient of M concerning 0 
is defined as follows: 


=p OM OGagr,a (5) 
4 1OGagra 90 


with, 
ws * Lj js(d) max pooling (6) 
OGia : 
Se aoa: a6 Hidue min pooling 
a6 re @ abba average pooling 
J=1 90 


: oe 2Gia “ia avg.sqr pooling 


Using (6) we can state that 4, = 1 provided j = k; otherwise uj, = 0. The parameters jj(d) and j;(d) 
define the feature vector with the smallest and largest d* component. Using all pooling together assures that 
gradient weights can be optimized more efficiently; thus, aiding in achieving better tampering detection 
accuracy. Finally, the optimized RSF are aggregated for building descriptor G using two-layer fully 
connected CNN [39]; this assures higher accuracy with minimal computation time. 


2.3. Training of CNN 
The RSF-TD without the need for any fixed network can be trained end-to-end by using the whole 
image. Then, the decision is taken to classify image is tampered or not. Here loss functions are back- 
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propagated within the network of respective patches; this helps in obtaining better-correlated features in an 
adaptive nature. Thus, assures RSF-TD achieves better detection accuracy with less misclassification. 


3. RESULTS AND DISCUSSIONS 

Here experiment is conducted to study the tampering detection outcome of RSF-TD and standard 
tampering detection methods. The experiment is conducted using window 10 operating system with Intel 
quad processor and 16 GB Ram. The RSF-TD has been implemented through Matlab, C++, and the 
Anaconda python 3 frameworks. The experiment is done using two standard datasets such as media 
integration and communication center (MICC) and DO dataset because it has undergone a hybrid 
transformation attack. More detail of the dataset used for the experiment is given in Table 1. The metric used 
for evaluating the performance of RSF-TD and the existing tampering detection model [40], [41] is recall, 
Fl-score, and false-positive rate. 


Table 1. Dataset description 
Dataset Image size _ Scaling and rotation _ Compression 
MICC 600 Yes No 
DO 50 yes Yes 


3.1. Evaluation of performance on MICC dataset 

The MICC is composed of 300 genuine images and 300 images undergo a hybrid transformation 
attack. Therefore, a total of 600 images are available. On average 1.2% of patches are tampered with. Thus, it 
is extremely difficult in detecting tampering. The Figure 2(a) shows the original image, Figure 2(b) shows 
respective segmentation outcome using RSF-TD respectively. In Figure 2, the tampering region segmentation 
outcome achieved using RSF-TD is shown. Similarly, The Figure 3(a) shows the original image, Figure 3(b) 
shows respective ground truth of tampered region, segmentation outcome achieved using existing and 
RSF-TD model is shown in Figure 3(c) and Figure 3(d), respectively. Figure 3 shows the segmentation 
outcome achieved using RSF-TD and the existing segmentation model [26]. Table 2, shows the recall, false 
positive rate (FPR), and Fl-Score performance achieved using RSF-TD and the existing tampering detection 
model [40]. From the result, we can state the RSF-TD archives have much better detection accuracies under 
hybrid attack in comparison with existing tampering detection models. 


(a) 


(b) 


Figure 2. The results of the proposed RSF-CNN model; (a) original image and (b) segmentation outcome 
using RSF-TD 


3.2. Evaluation of performance on DO dataset 

In this section performance of RSF-TD and the existing tampering, detection methods are studied 
using the DO dataset. The dataset undergoes hybrid transformation such as scaling and rotation with JPEG 
compression. Thus, tampering detection makes very challenging. The Figure 4(a) shows the original image, 
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Figure 4(b) shows respective ground truth of tampered region, feature extraction and segmentation outcome 
achieved using existing and RSF-TD model is shown in Figure 4(c), and Figure 4(d), respectively. In Figure 4, 
the segmentation outcome of the tampered region using RSF-TD is shown. The detection accuracy using 
RSF-TD and the existing model on DO dataset is shown in Table 3. Through the outcome achieved we can 
state that RSF-TD performs significantly better than the existing tampering detection model in terms of 
Fl-score, FPR, precision, and recall. 


(d) 


Figure 3. Tampering region segmentation outcome using RSF-TD and existing tampering region 
segmentation model; (a) original image, (b) ground truth, (c) existing model [26], and (d) RSF-TD 


Table 2. RSF-TD and existing method tampering detection accuracy for MICC dataset 


Model Recall//TPR FPR __ Fl-Score 
Raju and Nair 2018 [40] 89.14 - 92.6 
RSF-TD 97.5 1.4 97.7 


(b) 


Figure 4. Segmentation outcome of RSF-TD model; (a) input image, (b) ground truth, (c) feature extraction, 
and (d) segmentation outcome 
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Table 3. RSF-TD and existing tampering detection model for DO dataset 


Model Recall Precision FPR Fl 
Huang and Ciou [41] 84.88 92.81 3.39 88.67 
RSF-CNN 98.08 98.84 1.68 99.28 


3.3. Computation complexity performance evaluation 

Here experiment is done to study the computation complexity of different tampering detection 
methods. The computation complexity is measured in terms of the amount of time taken to detect tampering 
regions. Figure 5 shows the computation time taken to process MICC-600 using the deep learning-based 
method [42], the hierarchical method, and RSF-TD. Similarly, Figure 6 shows the computation time taken to 
process MICC-2000 using the deep learning-based method [42] and RSF-TD. Figure 7 shows the 
computation time taken to process the CoMoFoD dataset using the DBSCAN-based method, and RSF-TD. 
From Figures 5 to 7 we can see the RSF-TD significantly reduces computation time in comparison with the 
deep learning-based method, hierarchical method, and DBSCAN-based method. 
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Figure 5. Computation time for MICC-600 Figure 6. Computation time for MICC-2000 


Computation complexity for CoMoFoD 
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Figure 7. Computation time for CoMoFoD 


4. CONCLUSION 

Here the paper emphasized using the resampling feature through effective preprocessing and CNN 
model. The RSF-TD brings in good tradeoffs between achieving higher detection accuracies with minimal 
computation time. Further, able to achieve higher detection accuracies with better segmentation outcomes in 
comparison with the standard tampering detection model. The result proves the RSF-TD can extract highly 
correlated features and eliminate spatial dependencies. The experiment conducted on two datasets with 
hybrid tampering attack transformation shows the RSF-TD achieves much better accuracies in comparison 
with the recent tampering detection model which is measured through the following metrics such as F1-score, 
FPR, TPR, precision, and recall. Despite achieving superior detection accuracies the model can be further 
improved in the future by eliminating outliers through effective design of CNN framework that is robust 
against noise. Further, validate the model using more diverse datasets. 
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